Energy-Efficient Approximate Edge Inference Systems

نویسندگان

چکیده

The rapid proliferation of the Internet Things and dramatic resurgence artificial intelligence based application workloads have led to immense interest in performing inference on energy-constrained edge devices. Approximate computing (a design paradigm that trades off a small degradation quality for disproportionate energy savings) is promising technique enable energy-efficient at edge. This article introduces concept an approximate system ( AxIS ) proposes systematic methodology perform joint approximations between different subsystems deep neural network (DNN)-based system, leading significant benefits compared approximating individual isolation. We use smart camera executes various DNN-based image classification object detection applications illustrate how sensor, memory, compute, communication can all be approximated synergistically. demonstrate our proposed using two variants system: (a) Cam Edge , where DNN executed locally device, (b) Cloud device sends captured remote cloud server DNN. prototyped such Intel Stratix IV GX-based Terasic TR4-230 FPGA development board. Experimental results obtained six large DNNs four compact running savings (≈ 1.6× -4.7× ≈ 1.5× -3.6× DNNs), minimal (<1%) loss application-level quality. Furthermore, exhibit -5.2× similar loss. Compared single subsystem isolation, achieves 1.05× -3.25× gains 1.35× -4.2× average,

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Approximate computing for energy-efficient error-resilient multimedia systems

The rapid advancement in scaled silicon technology has resulted in the influx of numerous consumer devices with a plethora of applications. Multimedia applications which use image and video processing, pattern or facial recognition, data mining and synthesis have seen a significant increase in user base. These applications not only demand complex signal processing of digital data to achieve qua...

متن کامل

FastInf: An Efficient Approximate Inference Library

The FastInf C++ library is designed to perform memory and time efficient approximate inference in large-scale discrete undirected graphical models. The focus of the library is propagation based approximate inference methods, ranging from the basic loopy belief propagation algorithm to propagation based on convex free energies. Various message scheduling schemes that improve on the standard sync...

متن کامل

Approximate Inference of Bayesian Networks through Edge Deletion

In this paper, we introduce two new algorithms for approximate inference of Bayesian networks that use edge deletion techniques. The first reduces a network to its maximal weight spanning tree using the Kullback-Leibler information divergence as edge weights, and then runs Pearl’s algorithm on the resulting tree for linear-time inference. The second algorithm deletes edges from the triangulated...

متن کامل

Energy-Efficient Approximate Computation in Topaz

We present Topaz, a new task-based language for computations thatexecute on approximate computing platforms that may occasion-ally produce arbitrarily inaccurate results. The Topaz implementa-tion maps approximate tasks onto the approximate machine and in-tegrates the approximate results into the main computation, deploy-ing a novel outlier detection and reliable reexecution...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Transactions in Embedded Computing Systems

سال: 2023

ISSN: ['1539-9087', '1558-3465']

DOI: https://doi.org/10.1145/3589766